12 research outputs found

    Effectiveness of machine learning and deep learning models at county-level soybean yield forecasting

    Get PDF
    Crop yield forecasting is critical in modern agriculture to ensure food security, economic stability, and effective resource management. The main goal of this study was to combine historical multisource satellite and environmental datasets with a deep learning (DL) model for soybean yield forecasting in the United States’ Corn Belt. The following Moderate Resolution Imaging Spectroradiometer (MODIS) products were aggregated at the county level. The crop data layer (CDL) in Google Earth Engine (GEE) was used to mask the data so that only soybean pixels were selected. Several machine learning (ML) models were trained by using 5 years of data from 2012 to 2016: random forest (RF), least absolute shrinkable and selection operator (LASSO) regression, extreme gradient boosting (XGBoost), and decision tree regression (DTR) as well as DL-based one-dimensional convolutional neural network (1D-CNN). The best model was determined by comparing their performances at forecasting the soybean yield in 2017–2021 at the county scale. The RF model outperformed all other ML models with the lowest RMSE of 0.342 t/ha, followed by XGBoost (0.373 t/ha), DTR (0.437 t/ha), and LASSO (0.452 t/ha) regression. However, the 1D-CNN model showed the highest forecasting accuracy for the 2018 growing season with RMSE of 0.280 t/ha. The developed 1D-CNN model has great potential for crop yield forecasting because it effectively captures temporal dependencies and extracts meaningful input features from sequential data

    Sunflower crop yield prediction by advanced statistical modeling using satellite-derived vegetation indices and crop phenology

    No full text
    Timely crop yield information is needed for agricultural land management and food security. We investigated using remote sensing data from the Earth observation mission Sentinel-2 to monitor the crop phenology and predict the crop yield of sunflowers at the field scale. Ten sunflower fields in Mezőhegyes, southeastern Hungary, were monitored in 2021, and the crop yield was measured by a combine harvester. Images from Sentinel-2 were collected throughout the monitoring period, and vegetation indices (VIs) were extracted to monitor the crop growth. Multiple linear regression and two different machine learning approaches were applied to predicting the crop yield, and the best-performing one was selected for further analysis. The results were as follows. The VIs showed the highest correlation with the crop yield (R > 0.6) during the inflorescence emergence stage. The most suitable time for predicting the crop yield was 86–116 days after sowing. Random forest regression (RFR) was the best machine learning approach for predicting field-scale variability of the crop yield (R2 ∼ 0.6 and RMSE 0.284–0.473 t/ha). Our results can be used to develop a timely and robust prediction method for sunflower crop yields at the field scale to support decision-making by policymakers regarding food security

    Comparison of PlanetScope, Sentinel-2, and landsat 8 data in soybean yield estimation within-field variability with random forest regression

    No full text
    Accurate timely and early-season crop yield estimation within the field variability is important for precision farming and sustainable management applications. Therefore, the ability to estimate the within-field variability of grain yield is crucial for ensuring food security worldwide, especially under climate change. Several Earth observation systems have thus been developed to monitor crops and predict yields. Despite this, new research is required to combine multiplatform data integration, advancements in satellite technologies, data processing, and the application of this discipline to agricultural practices. This study provides further developments in soybean yield estimation by comparing multisource satellite data from PlanetScope (PS), Sentinel-2 (S2), and Landsat 8 (L8) and introducing topographic and meteorological variables. Herein, a new method of combining soybean yield, global positioning systems, harvester data, climate, topographic variables, and remote sensing images has been demonstrated. Soybean yield shape points were obtained from a combine-harvester-installed GPS and yield monitoring system from seven fields over the 2021 season. The yield estimation models were trained and validated using random forest, and four vegetation indices were tested. The result showed that soybean yield can be accurately predicted at 3-, 10-, and 30-m resolutions with mean absolute error (MAE) value of 0.091 t/ha for PS, 0.118 t/ha for S2, and 0.120 t/ha for L8 data (root mean square error (RMSE) of 0.111, 0.076). The combination of the environmental data with the original bands provided further improvements and an accurate yield estimation model within the soybean yield variability with MAE of 0.082 t/ha for PS, 0.097 t/ha for S2, and 0.109 t/ha for L8 (RMSE of 0.094, 0.069, and 0.108 t/ha). The results showed that the optimal date to predict the soybean yield within the field scale was approximately 60 or 70 days before harvesting periods during the beginning bloom stage. The developed model can be applied for other crops and locations when suitable training yield data, which are critical for precision farming, are available
    corecore